You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Paul Jakubik <pa...@purediscovery.com> on 2010/11/17 22:25:49 UTC
Supported Document Format web page out of date
Hi,
I was looking at http://tika.apache.org/0.8/formats.html and found several
issues with it:
- Says that it lists the formats supported by Tika 0.6 instead of 0.8.
- Says that it has links to parser class javadocs when it doesn't.
- Though the page promises that the parser class java docs have more
detailed information about each document format and how it is parsed, the
two I looked at, OOXMLParser and OfficeParser, had no details in their
javadoc.
Paul