You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Li Leon <le...@gmail.com> on 2009/12/04 09:08:58 UTC

Issue filtering .rtf file with tika-app-0.4.exe

Hi all,


I converted the package to "tika-app-0.4.*exe*" using ikvmc.exe, so that I
could use Tika in .Net environment.

Everything worked fine so far except with .rtf file. The output(only
contains "this is rtf file" and attached) seemed to not only include the
pure content but also font and some other information. Has anyone
experienced this before?


*The code I used:*

Metadata metadata = new Metadata();

metadata.set(TikaMetadataKeys.__Fields.RESOURCE_NAME_KEY,

f.getCanonicalPath());

InputStream fis = new FileInputStream(f);

org.apache.tika.parser.Parser parser = new AutoDetectParser();

ContentHandler handler = new BodyContentHandler();





Thanks,