You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Mike Rodent (JIRA)" <ji...@apache.org> on 2017/02/12 15:01:41 UTC
[jira] [Created] (TIKA-2264) Better handling of footnotes/endnotes
for ODF files
Mike Rodent created TIKA-2264:
---------------------------------
Summary: Better handling of footnotes/endnotes for ODF files
Key: TIKA-2264
URL: https://issues.apache.org/jira/browse/TIKA-2264
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 1.14
Environment: N/A
Reporter: Mike Rodent
Priority: Minor
Springs from my question here (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs) ... I have improve the class OpenDocumentContentParser so that it puts footnotes/endnotes at the end of the line to which they belong and doesn't break up the line in question. As with .docx parsing the notes can be linked to the reference easily.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)