You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2011/01/17 22:13:45 UTC
[jira] Updated: (TIKA-586) Parsing a ms access file (*.mdb) throws
an error
[ https://issues.apache.org/jira/browse/TIKA-586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Martijn van Groningen updated TIKA-586:
---------------------------------------
Attachment: true-font.ttf
accessdb.mdb
TIKA-586.patch
The attached patch adds mime type element for a mdb file. I also updated the MimeDetectionTest. The attached test files should be put in the test resource's directory under /org/apache/tika/mime
> Parsing a ms access file (*.mdb) throws an error
> ------------------------------------------------
>
> Key: TIKA-586
> URL: https://issues.apache.org/jira/browse/TIKA-586
> Project: Tika
> Issue Type: Bug
> Components: mime
> Affects Versions: 0.8
> Reporter: Martijn van Groningen
> Priority: Minor
> Fix For: 0.9
>
> Attachments: accessdb.mdb, TIKA-586.patch, true-font.ttf
>
>
> I know that parsing a ms access file (*.mdb) is not supported since there is not parser for it, but I think it should not throw an exception.
> Currently when parsing a mdb file it is being recognized as a true font file. The TrueTypeParser throws an parser specific error when encountering a mdb file.
> Stacktrace:
> Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.font.TrueTypeParser@6906daba
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:203)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
> at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:94)
> at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:273)
> at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:80)
> Caused by: java.io.IOException: Unexpected end of TTF stream reached
> at org.apache.fontbox.ttf.TTFDataStream.read(TTFDataStream.java:217)
> at org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:69)
> at org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:57)
> at org.apache.fontbox.ttf.AbstractTTFParser.readTableDirectory(AbstractTTFParser.java:214)
> at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:85)
> at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
> at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
> at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
> at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:63)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> ... 5 more
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.