You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2008/03/28 21:28:24 UTC
[jira] Commented: (TIKA-136) Exception during command line calling
[ https://issues.apache.org/jira/browse/TIKA-136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12583184#action_12583184 ]
Jukka Zitting commented on TIKA-136:
------------------------------------
Karl in http://markmail.org/message/ejeddz5aeeblfbw2:
> i have taken a look into it and found that the above ticket based on an missing dependency in the pom file.
>
> You have only add the following parts:
> <dependency>
> <groupId>org.fontbox</groupId>
> <artifactId>fontbox</artifactId>
> <verison>0.1.0</version>
> </dependency>
FontBox should come in as a transitive dependency from PDFBox 0.7.3, so AFAIK we don't need to explicitly add it as a dependency.
My version of the -bin packages do contain fontbox and I have no problem parsing PDF files with the Tika command line.
> Exception during command line calling
> -------------------------------------
>
> Key: TIKA-136
> URL: https://issues.apache.org/jira/browse/TIKA-136
> Project: Tika
> Issue Type: Bug
> Components: config
> Affects Versions: 0.2-incubating
> Environment: Windows XP; Java 1.5
> Reporter: Karl Heinz Marbaise
> Priority: Blocker
>
> Exception in thread "main" java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser
> at org.pdfbox.pdmodel.font.PDFont.getAFM(PDFont.java:350)
> at org.pdfbox.pdmodel.font.PDFont.getAverageFontWidthFromAFMFile(PDFont.java:313)
> at org.pdfbox.pdmodel.font.PDSimpleFont.getAverageFontWidth(PDSimpleFont.java:231)
> at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:276)
> at org.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:80)
> at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
> at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
> at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
> at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
> at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259
> )
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
> at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
> at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:53)
> at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:8
> 4)
> at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:118)
> at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:64)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.