You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Andrzej Bialecki (Created) (JIRA)" <ji...@apache.org> on 2011/12/05 13:37:39 UTC

[jira] [Created] (TIKA-801) ContentHandlerDecorator outputs invalid element

ContentHandlerDecorator outputs invalid element
-----------------------------------------------

                 Key: TIKA-801
                 URL: https://issues.apache.org/jira/browse/TIKA-801
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.0, 1.1
            Reporter: Andrzej Bialecki 


* Start Tika GUI
* try opening test-outlook.msg (from tika-parsers test resources)
* the following exception is thrown:
{code}
org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
	at java.awt.Component.processMouseEvent(Component.java:6373)
	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
	at java.awt.Component.processEvent(Component.java:6138)
	at java.awt.Container.processEvent(Container.java:2085)
	at java.awt.Component.dispatchEventImpl(Component.java:4735)
	at java.awt.Container.dispatchEventImpl(Container.java:2143)
	at java.awt.Component.dispatchEvent(Component.java:4565)
	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
	at java.awt.Container.dispatchEventImpl(Container.java:2129)
	at java.awt.Window.dispatchEventImpl(Window.java:2478)
	at java.awt.Component.dispatchEvent(Component.java:4565)
	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
	at java.awt.EventQueue.access$000(EventQueue.java:85)
	at java.awt.EventQueue$1.run(EventQueue.java:638)
	at java.awt.EventQueue$1.run(EventQueue.java:636)
	at java.security.AccessController.doPrivileged(Native Method)
	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
	at java.awt.EventQueue$2.run(EventQueue.java:652)
	at java.awt.EventQueue$2.run(EventQueue.java:650)
	at java.security.AccessController.doPrivileged(Native Method)
	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
Caused by: java.lang.NullPointerException
	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
	... 44 more
{code}

The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Paul Hill (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164877#comment-13164877 ] 

Paul Hill commented on TIKA-801:
--------------------------------

If I am having the same problem, as suggested by Mike McCandless on the user list.  Then it is easy to reproduce.  No attachments required.  Just forward an e-mail 2 or 3 times to your self within Outlook, then copy and past onto your filesystem to create an msg file.  My 1st example was from last year or older, so the latest Outlook is NOT required.

Then drop onto Tika-app 1.0 (but not 0.7, 0.9, 0.10) and you get the following

org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@97de276
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
	at org.apache.tika.gui.ParsingTransferHandler.importFiles(ParsingTransferHandler.java:94)
	at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:77)
[...]
Caused by: java.lang.NullPointerException
	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(Unknown Source)
	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(Unknown Source)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:178)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	... 41 more


                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Issue Comment Edited] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Paul Hill (Issue Comment Edited) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164879#comment-13164879 ] 

Paul Hill edited comment on TIKA-801 at 12/8/11 12:43 AM:
----------------------------------------------------------

The attached file stack dumps in Tika 1.0 as of 2011-12-05 Tika-app-1.0 Release
                
      was (Author: pahill):
    This files stack dumps in Tika 1.0 as of 2011-12-05 Tika-app-1.0 Release
                  
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Assigned] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless reassigned TIKA-801:
---------------------------------------

    Assignee: Michael McCandless
    
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165288#comment-13165288 ] 

Michael McCandless commented on TIKA-801:
-----------------------------------------


Actually this isn't a problem of a parser outputting metadata after
startDocument...

The problem, for both of the test docs, is that the Outlook message
has a chunk of RTF text and so OutlookExtractor recurses into the
RTFParser.

RTFParser then calls start/endDocument itself.

I can fix this by having RTFParser expose a separate parse method,
with control over whether or not it should call start/endDocument
itself; that seems to fix these two test docs.

However, if the Outlook message has an HTML chunk, it's also broken:
try running TikaGUI on
tika-parsers/src/test/resources/test-documents/testMSG_chinese.msg
(that's an HTML Outlook message).

How can/should we fix that one?  It's tagsoup that's calling
.endDocument...

                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Paul Hill (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165579#comment-13165579 ] 

Paul Hill commented on TIKA-801:
--------------------------------

Thanks Jukka and Michael. Your quick response is appreciated.
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg, TIKA-801.patch
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless resolved TIKA-801.
-------------------------------------

       Resolution: Fixed
    Fix Version/s: 1.1
    
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>             Fix For: 1.1
>
>         Attachments: FW  Testing.msg, TIKA-801.patch
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162781#comment-13162781 ] 

Michael McCandless commented on TIKA-801:
-----------------------------------------

This is happening because the OfficeParser is producing a 2nd body endElement (ie </body>) without a matching body (<body>) startElement.

TIKA-715 would have caught this earlier... when I enable the asserts from there, and run TikaCLI to extract text from this doc, indeed I hit:

{noformat}
Exception in thread "main" java.lang.AssertionError: end tag=body with no startElement
	at org.apache.tika.sax.SafeContentHandler.verifyEndElement(SafeContentHandler.java:219)
	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:270)
	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:130)
	at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:397)
	at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:101)
{noformat}

(There's also a 2nd mismatched </html> tag).

But... I don't know why OfficeParser is producing a mismatched </body></html> for this document!

Maybe, it's invoking a sub-parser but failing to wrap the ContentHandler with EndDocumentShieldingContentHandler?  (OpenDocumentParser uses EndDocumentShieldingContentHandler for this same reason...).

                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162795#comment-13162795 ] 

Jukka Zitting commented on TIKA-801:
------------------------------------

bq. EndDocumentShieldingContentHandler

IMHO we shouldn't be using the EDSCH mechanism. As noted by Nick in TIKA-646, the correct fix for cases like this would be to update the parsers to generate the metadata *before* they call endDocument. The EDSCH solution only fixes the symptoms but not the root cause of the problem.
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Issue Comment Edited] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Paul Hill (Issue Comment Edited) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164879#comment-13164879 ] 

Paul Hill edited comment on TIKA-801 at 12/8/11 12:53 AM:
----------------------------------------------------------

The attached file stack dumps in Tika 1.0 as of 2011-12-05 Tika-app-1.0 Release (not the nightly build).
                
      was (Author: pahill):
    The attached file stack dumps in Tika 1.0 as of 2011-12-05 Tika-app-1.0 Release
                  
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165291#comment-13165291 ] 

Jukka Zitting commented on TIKA-801:
------------------------------------

See the org.apache.tika.sax.EmbeddedContentHandler class. It's explicitly designed for cases like this.
The ParsingEmbeddedDocumentExtractor class has an example of how to use EmbeddedContentHandler.
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13166148#comment-13166148 ] 

Jukka Zitting commented on TIKA-801:
------------------------------------

bq. patch attached

Looks good, +1.
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg, TIKA-801.patch
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Paul Hill (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paul Hill updated TIKA-801:
---------------------------

    Attachment: FW  Testing.msg

This files stack dumps in Tika 1.0 as of 2011-12-05 Tika-app-1.0 Release
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>         Attachments: FW  Testing.msg
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-801) ContentHandlerDecorator outputs invalid element

Posted by "Michael McCandless (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless updated TIKA-801:
------------------------------------

    Attachment: TIKA-801.patch

bq. See the org.apache.tika.sax.EmbeddedContentHandler class. 

Excellent!

I did that (patch attached) and these RTF/HTML Outlook docs are now fine through TikaGUI.
                
> ContentHandlerDecorator outputs invalid element
> -----------------------------------------------
>
>                 Key: TIKA-801
>                 URL: https://issues.apache.org/jira/browse/TIKA-801
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.0, 1.1
>            Reporter: Andrzej Bialecki 
>            Assignee: Michael McCandless
>         Attachments: FW  Testing.msg, TIKA-801.patch
>
>
> * Start Tika GUI
> * try opening test-outlook.msg (from tika-parsers test resources)
> * the following exception is thrown:
> {code}
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@12e14ebc
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:245)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> 	at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
> 	at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
> 	at org.apache.tika.gui.TikaGUI.actionPerformed(TikaGUI.java:238)
> 	at javax.swing.AbstractButton.fireActionPerformed(AbstractButton.java:2028)
> 	at javax.swing.AbstractButton$Handler.actionPerformed(AbstractButton.java:2351)
> 	at javax.swing.DefaultButtonModel.fireActionPerformed(DefaultButtonModel.java:387)
> 	at javax.swing.DefaultButtonModel.setPressed(DefaultButtonModel.java:242)
> 	at javax.swing.AbstractButton.doClick(AbstractButton.java:389)
> 	at javax.swing.plaf.basic.BasicMenuItemUI.doClick(BasicMenuItemUI.java:809)
> 	at com.apple.laf.AquaMenuItemUI.doClick(AquaMenuItemUI.java:137)
> 	at javax.swing.plaf.basic.BasicMenuItemUI$Handler.mouseReleased(BasicMenuItemUI.java:850)
> 	at java.awt.Component.processMouseEvent(Component.java:6373)
> 	at javax.swing.JComponent.processMouseEvent(JComponent.java:3267)
> 	at java.awt.Component.processEvent(Component.java:6138)
> 	at java.awt.Container.processEvent(Container.java:2085)
> 	at java.awt.Component.dispatchEventImpl(Component.java:4735)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2143)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4621)
> 	at java.awt.LightweightDispatcher.processMouseEvent(Container.java:4282)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4212)
> 	at java.awt.Container.dispatchEventImpl(Container.java:2129)
> 	at java.awt.Window.dispatchEventImpl(Window.java:2478)
> 	at java.awt.Component.dispatchEvent(Component.java:4565)
> 	at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:679)
> 	at java.awt.EventQueue.access$000(EventQueue.java:85)
> 	at java.awt.EventQueue$1.run(EventQueue.java:638)
> 	at java.awt.EventQueue$1.run(EventQueue.java:636)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:98)
> 	at java.awt.EventQueue$2.run(EventQueue.java:652)
> 	at java.awt.EventQueue$2.run(EventQueue.java:650)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(AccessControlContext.java:87)
> 	at java.awt.EventQueue.dispatchEvent(EventQueue.java:649)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:296)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:211)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:201)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:196)
> 	at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:188)
> 	at java.awt.EventDispatchThread.run(EventDispatchThread.java:122)
> Caused by: java.lang.NullPointerException
> 	at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(ToHTMLStream.java:907)
> 	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(TransformerHandlerImpl.java:273)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.gui.TikaGUI$2.endElement(TikaGUI.java:519)
> 	at org.apache.tika.sax.TeeContentHandler.endElement(TeeContentHandler.java:94)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)
> 	at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:273)
> 	at org.apache.tika.sax.XHTMLContentHandler.endDocument(XHTMLContentHandler.java:213)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:159)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:243)
> 	... 44 more
> {code}
> The same file is parsed without any errors when not in GUI mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira