You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/10/07 11:12:29 UTC

[jira] [Resolved] (TIKA-685) Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@1a8402c

     [ https://issues.apache.org/jira/browse/TIKA-685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-685.
--------------------------------

    Resolution: Duplicate

Works with latest Tika, so resolving as a duplicate of some of the other recent POI enhancements.
                
> Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@1a8402c
> --------------------------------------------------------------------------------------
>
>                 Key: TIKA-685
>                 URL: https://issues.apache.org/jira/browse/TIKA-685
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>         Environment: MS Windows XP Professional Version 2002 Service Pack 3
>            Reporter: Jaroslaw Krzeminski
>         Attachments: 1.doc
>
>
> Runtime error while parsing MS Word document with Apache Tika GUI App or from a program snippet like:
> InputStream inputStream = new FileInputStream(docFile);
> ContentHandler contentHandler = new BodyContentHandler(new BufferedWriter(new FileWriter(textFile)));
> Metadata metadata = new Metadata();
> AutoDetectParser parser = new AutoDetectParser();
> parser.parse(inputStream, contentHandler, metadata);
> Error from Tika App Errors panel:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@1a8402c
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
> 	at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
> 	at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:99)
> 	at javax.swing.TransferHandler.importData(Unknown Source)
> 	at javax.swing.TransferHandler$DropHandler.drop(Unknown Source)
> 	at java.awt.dnd.DropTarget.drop(Unknown Source)
> 	at javax.swing.TransferHandler$SwingDropTarget.drop(Unknown Source)
> 	at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(Unknown Source)
> 	at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(Unknown Source)
> 	at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(Unknown Source)
> 	at sun.awt.dnd.SunDropTargetEvent.dispatch(Unknown Source)
> 	at java.awt.Component.dispatchEventImpl(Unknown Source)
> 	at java.awt.Container.dispatchEventImpl(Unknown Source)
> 	at java.awt.Component.dispatchEvent(Unknown Source)
> 	at java.awt.LightweightDispatcher.retargetMouseEvent(Unknown Source)
> 	at java.awt.LightweightDispatcher.processDropTargetEvent(Unknown Source)
> 	at java.awt.LightweightDispatcher.dispatchEvent(Unknown Source)
> 	at java.awt.Container.dispatchEventImpl(Unknown Source)
> 	at java.awt.Window.dispatchEventImpl(Unknown Source)
> 	at java.awt.Component.dispatchEvent(Unknown Source)
> 	at java.awt.EventQueue.dispatchEventImpl(Unknown Source)
> 	at java.awt.EventQueue.access$000(Unknown Source)
> 	at java.awt.EventQueue$1.run(Unknown Source)
> 	at java.awt.EventQueue$1.run(Unknown Source)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(Unknown Source)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(Unknown Source)
> 	at java.awt.EventQueue$2.run(Unknown Source)
> 	at java.awt.EventQueue$2.run(Unknown Source)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.security.AccessControlContext$1.doIntersectionPrivilege(Unknown Source)
> 	at java.awt.EventQueue.dispatchEvent(Unknown Source)
> 	at java.awt.EventDispatchThread.pumpOneEventForFilters(Unknown Source)
> 	at java.awt.EventDispatchThread.pumpEventsForFilter(Unknown Source)
> 	at java.awt.EventDispatchThread.pumpEventsForHierarchy(Unknown Source)
> 	at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
> 	at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
> 	at java.awt.EventDispatchThread.run(Unknown Source)
> Caused by: java.lang.NullPointerException
> 	at org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(CharacterSprmUncompressor.java:39)
> 	at org.apache.poi.hwpf.model.CHPX.getCharacterProperties(CHPX.java:61)
> 	at org.apache.poi.hwpf.usermodel.CharacterRun.<init>(CharacterRun.java:98)
> 	at org.apache.poi.hwpf.usermodel.Range.getCharacterRun(Range.java:797)
> 	at org.apache.poi.hwpf.model.PicturesTable.getAllPictures(PicturesTable.java:191)
> 	at org.apache.tika.parser.microsoft.WordExtractor$PicturesSource.<init>(WordExtractor.java:430)
> 	at org.apache.tika.parser.microsoft.WordExtractor$PicturesSource.<init>(WordExtractor.java:420)
> 	at org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:75)
> 	at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:182)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> 	... 39 more
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira