You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/10/25 16:38:59 UTC

[jira] [Commented] (TIKA-2143) POI deprecated method used in TIKA 1.13

    [ https://issues.apache.org/jira/browse/TIKA-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605797#comment-15605797 ] 

Tim Allison commented on TIKA-2143:
-----------------------------------

Hi [~sbathrutheen], thank you for raising this.  From the stacktrace, it looks like the problem is initiated by Tika calling <init>...I don't think this is a problem with Tika calling a deprecated method.  Is there a chance you have a version of POI on your classpath that is older than 3.15-beta1?  The line numbers in the stacktrace don't line up at all with POI 3.15-beta1's [HSLFSlideShow|http://svn.apache.org/viewvc/poi/tags/REL_3_15_BETA1/src/scratchpad/src/org/apache/poi/hslf/usermodel/HSLFSlideShow.java?view=markup#l188] 

> POI deprecated method used in TIKA 1.13 
> ----------------------------------------
>
>                 Key: TIKA-2143
>                 URL: https://issues.apache.org/jira/browse/TIKA-2143
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.9, 1.13
>         Environment: Windows java application
>            Reporter: sbathrutheen
>            Priority: Trivial
>             Fix For: 1.13
>
>
> We see that TIKA throws a long list of errors when extraction ppt files. We  tested with standalone tike application (1.13) we cannot reproduce the issue.
> We took a look at POI source code and abserved the class "HSLFSlideShow" we could see the below deprecated method defined 
> *
> /**
> -	 * Get the lookup from slide numbers to their offsets inside
> -	 *  _ptrData, used when adding or moving slides.
> -	 * 
> -	 * @deprecated since POI 3.11, not supported anymore
> -	 */
> -	@Deprecated
> -	public Hashtable<Integer,Integer> getSlideOffsetDataLocationsLookup() {
> -		throw new UnsupportedOperationException("PersistPtrHolder.getSlideOffsetDataLocationsLookup() is not supported since 3.12-Beta1");
> -	}
> *
> we may think Tika library still calling this deprecated method causing this run time Exception
> Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@204c3b78
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:283)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:281)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> at com.searchtechnologies.aspire.docprocessing.extracttext.ExtractTextStage.process(ExtractTextStage.java:140)
> ... 14 more
> Caused by: java.lang.UnsupportedOperationException
> at java.util.AbstractMap$SimpleImmutableEntry.setValue(Unknown Source)
> at org.apache.poi.hslf.HSLFSlideShow.read(HSLFSlideShow.java:293)
> at org.apache.poi.hslf.HSLFSlideShow.buildRecords(HSLFSlideShow.java:273)
> at org.apache.poi.hslf.HSLFSlideShow.<init>(HSLFSlideShow.java:188)
> at org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:61)
> at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:149)
> at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:117)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:281)
> ... 17 more



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)