You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Serban Alexe (JIRA)" <ji...@apache.org> on 2018/02/01 11:25:00 UTC
[jira] [Updated] (TIKA-2555) Text with [underline] + [another
format] in word document generates overlapping html tags.
[ https://issues.apache.org/jira/browse/TIKA-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Serban Alexe updated TIKA-2555:
-------------------------------
Priority: Minor (was: Major)
> Text with [underline] + [another format] in word document generates overlapping html tags.
> ------------------------------------------------------------------------------------------
>
> Key: TIKA-2555
> URL: https://issues.apache.org/jira/browse/TIKA-2555
> Project: Tika
> Issue Type: Bug
> Affects Versions: 1.17
> Reporter: Serban Alexe
> Priority: Minor
> Attachments: Clipboard02.jpg
>
>
> I have a sample _.docx_ document which contains one single line of text**++.
> Making that text to be:
> * +underlined+
> ** AND at least one of the following two
> * _italic_
> * *bold*****
> will cause the generated _.xhtml_ file to contain overlapping tags.
>
> _+Example+_:
> *+The quick brown fox jumps over the lazy dog.+*
> will result in
> <b><u>The quick brown fox jumps over the lazy dog.</b></u>
> which causes some browser (Firefox, Chrome) to give an error and not display the content of the file...
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)