You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2016/12/01 00:46:58 UTC

[jira] [Commented] (TIKA-2187) Align default behavior of experimental docx parser with that of doc parser in handling delText

    [ https://issues.apache.org/jira/browse/TIKA-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15710370#comment-15710370 ] 

Luis Filipe Nassif commented on TIKA-2187:
------------------------------------------

Thank you [~tallison@apache.org] for making it configurable!!!

> Align default behavior of experimental docx parser with that of doc parser in handling delText
> ----------------------------------------------------------------------------------------------
>
>                 Key: TIKA-2187
>                 URL: https://issues.apache.org/jira/browse/TIKA-2187
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.15
>
>
> Now that we can ignore delText via the experimental alternate SAXParser for .docx files, let's make that the default behavior to align with the expected behavior for our .doc parser (ignore deleted text).
> Let's also add the ability to include deleted text from .doc files.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)