You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Torsten Krah (Created) (JIRA)" <ji...@apache.org> on 2011/11/26 17:09:40 UTC

[jira] [Created] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
----------------------------------------------------------------------------------------------------------------

                 Key: TIKA-792
                 URL: https://issues.apache.org/jira/browse/TIKA-792
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.0
         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
            Reporter: Torsten Krah


Parsing some OOXML documents, this stacktrace is logged many times:

java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
	at java.lang.Class.getConstructor0(Class.java:2723)
	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)


Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Marek Slama (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13245404#comment-13245404 ] 

Marek Slama commented on TIKA-792:
----------------------------------

I do not see this problem now as we upgraded to Jackrabbit 2.4.0 which uses POI 3.8-beta4.
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Marek Slama (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202473#comment-13202473 ] 

Marek Slama commented on TIKA-792:
----------------------------------

We have the same problem. But this time it is Jackrabbit background indexing task so I cannot easily say which file causes this. We put our files into Jackrabbit repository. I will try to replace schemas files as suggested but I have to do it also in Jackabbit files.
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Eric Pascal (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13433145#comment-13433145 ] 

Eric Pascal commented on TIKA-792:
----------------------------------

Problem still there for me in version 3.8 final of POI
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>             Fix For: 1.2
>
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13245436#comment-13245436 ] 

Nick Burch commented on TIKA-792:
---------------------------------

Thanks for the feedback Marek. As of r1309005 we're now on POI 3.8 Final, so I'll mark this as fixed
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>             Fix For: 1.2
>
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13189839#comment-13189839 ] 

Nick Burch commented on TIKA-792:
---------------------------------

Are you able to share one of the files that triggers this? The easiest way to add the unit test to POI that would have the classes included is with a file that triggers the problem
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-792.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.2
    
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>             Fix For: 1.2
>
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Torsten Krah (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13189855#comment-13189855 ] 

Torsten Krah commented on TIKA-792:
-----------------------------------

I'll try to find some document(s), may take a few days.
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157558#comment-13157558 ] 

Nick Burch commented on TIKA-792:
---------------------------------

Your quick fix is to replace the poi-ooxml-schemas jar with the full ooxml-schemas-1.1 jar - the former is an excerpt of just the "common" parts

For a full fix, we need to add a unit test to POI the uses the same method. To decide what's common, the POI build script looks at what is used in the unit tests, so adding a test that uses a method will cause the appropriate parts to be included in the next poi-ooxml-schemas file
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira