You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "John Mastarone (Created) (JIRA)" <ji...@apache.org> on 2011/12/12 03:56:31 UTC

[jira] [Created] (TIKA-809) IndexOutOfBoundsException with TikaGUI

IndexOutOfBoundsException with TikaGUI
--------------------------------------

                 Key: TIKA-809
                 URL: https://issues.apache.org/jira/browse/TIKA-809
             Project: Tika
          Issue Type: Bug
          Components: gui
    Affects Versions: 1.1
         Environment: Windows 7
            Reporter: John Mastarone


>From issue Tika-410: the issue's attached test document, "test.doc", causes an IndexOutOfBoundsException when dropped into a latest build of the tika-app GUI.  The reason is that the org.apache.poi.hwpf.usermodel.Picture class's suggestFileExtension() method, used by Tika's WordExtractor class, can return an empty string.  The requestSave method of the ImageSavingParser inner class (of TikaGUI) is not written to handle the case of an image having no extension.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-809) IndexOutOfBoundsException with TikaGUI

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167324#comment-13167324 ] 

Nick Burch commented on TIKA-809:
---------------------------------

This should be improved when we move to POI 3.8 beta 5, as that has improved image detection logic. Does need fixing though, will look at that shortly
                
> IndexOutOfBoundsException with TikaGUI
> --------------------------------------
>
>                 Key: TIKA-809
>                 URL: https://issues.apache.org/jira/browse/TIKA-809
>             Project: Tika
>          Issue Type: Bug
>          Components: gui
>    Affects Versions: 1.1
>         Environment: Windows 7
>            Reporter: John Mastarone
>         Attachments: TIKA-809.patch, test.doc
>
>
> From issue Tika-410: the issue's attached test document, "test.doc", causes an IndexOutOfBoundsException when dropped into a latest build of the tika-app GUI.  The reason is that the org.apache.poi.hwpf.usermodel.Picture class's suggestFileExtension() method, used by Tika's WordExtractor class, can return an empty string.  The requestSave method of the ImageSavingParser inner class (of TikaGUI) is not written to handle the case of an image having no extension.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-809) IndexOutOfBoundsException with TikaGUI

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167325#comment-13167325 ] 

Nick Burch commented on TIKA-809:
---------------------------------

Fixed in r1213136.
                
> IndexOutOfBoundsException with TikaGUI
> --------------------------------------
>
>                 Key: TIKA-809
>                 URL: https://issues.apache.org/jira/browse/TIKA-809
>             Project: Tika
>          Issue Type: Bug
>          Components: gui
>    Affects Versions: 1.1
>         Environment: Windows 7
>            Reporter: John Mastarone
>             Fix For: 1.1
>
>         Attachments: TIKA-809.patch, test.doc
>
>
> From issue Tika-410: the issue's attached test document, "test.doc", causes an IndexOutOfBoundsException when dropped into a latest build of the tika-app GUI.  The reason is that the org.apache.poi.hwpf.usermodel.Picture class's suggestFileExtension() method, used by Tika's WordExtractor class, can return an empty string.  The requestSave method of the ImageSavingParser inner class (of TikaGUI) is not written to handle the case of an image having no extension.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (TIKA-809) IndexOutOfBoundsException with TikaGUI

Posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-809.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.1
    
> IndexOutOfBoundsException with TikaGUI
> --------------------------------------
>
>                 Key: TIKA-809
>                 URL: https://issues.apache.org/jira/browse/TIKA-809
>             Project: Tika
>          Issue Type: Bug
>          Components: gui
>    Affects Versions: 1.1
>         Environment: Windows 7
>            Reporter: John Mastarone
>             Fix For: 1.1
>
>         Attachments: TIKA-809.patch, test.doc
>
>
> From issue Tika-410: the issue's attached test document, "test.doc", causes an IndexOutOfBoundsException when dropped into a latest build of the tika-app GUI.  The reason is that the org.apache.poi.hwpf.usermodel.Picture class's suggestFileExtension() method, used by Tika's WordExtractor class, can return an empty string.  The requestSave method of the ImageSavingParser inner class (of TikaGUI) is not written to handle the case of an image having no extension.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-809) IndexOutOfBoundsException with TikaGUI

Posted by "John Mastarone (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Mastarone updated TIKA-809:
--------------------------------

    Attachment: test.doc
                TIKA-809.patch

Patch submitted to resolve issue.  Resolve by passing "null" as a suffix into the File.createTempFile method (when the embedded image extension is empty) which results in the suffix ".tmp" being used.  This does not conflict with any extensions in POI's PictureType enum, in case this matters.
                
> IndexOutOfBoundsException with TikaGUI
> --------------------------------------
>
>                 Key: TIKA-809
>                 URL: https://issues.apache.org/jira/browse/TIKA-809
>             Project: Tika
>          Issue Type: Bug
>          Components: gui
>    Affects Versions: 1.1
>         Environment: Windows 7
>            Reporter: John Mastarone
>         Attachments: TIKA-809.patch, test.doc
>
>
> From issue Tika-410: the issue's attached test document, "test.doc", causes an IndexOutOfBoundsException when dropped into a latest build of the tika-app GUI.  The reason is that the org.apache.poi.hwpf.usermodel.Picture class's suggestFileExtension() method, used by Tika's WordExtractor class, can return an empty string.  The requestSave method of the ImageSavingParser inner class (of TikaGUI) is not written to handle the case of an image having no extension.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira