You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2023/10/16 15:26:01 UTC

[jira] [Commented] (TIKA-4159) Boolean values and error msgs in xlsx

    [ https://issues.apache.org/jira/browse/TIKA-4159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17775812#comment-17775812 ] 

Tim Allison commented on TIKA-4159:
-----------------------------------

This is what we get for xls in both 5.2.3 and 5.2.4:
{noformat}
<table><tbody><tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td>	<td>this is some text</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
<tr>	<td>0</td></tr>
{noformat}

> Boolean values and error msgs in xlsx
> -------------------------------------
>
>                 Key: TIKA-4159
>                 URL: https://issues.apache.org/jira/browse/TIKA-4159
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: testEXCEL_boolean_and_errors.xlsx
>
>
> [~tilman] noted a large change in the handling of boolean values and error messages in the latest regression tests in prep for the 2.9.1 release.  The diffs in values were caused by changes between POI 5.2.3 and 5.2.4. I should not have dismissed Tilman's finding on the list as "already known." This is a problem.  Thank you!
> This is 5.2.3:
> {noformat}
> <table><tbody><tr>    <td>FALSE</td></tr>
> <tr>    <td>TRUE</td></tr>
> <tr>    <td>ERROR:#DIV/0!</td></tr>
> <tr>    <td>ERROR:#VALUE!</td>    <td>this is some text</td></tr>
> <tr>    <td>ERROR:#DIV/0!</td></tr>
> <tr>    <td>0</td></tr>
> <tr>    <td>ERROR:#NAME?</td></tr>
> <tr>    <td>ERROR:#VALUE!</td></tr>
> <tr>    <td>ERROR:#VALUE!</td></tr>
> </tbody></table> {noformat}
> This is 5.2.4
> {noformat}
> <table><tbody><tr>	<td>0</td></tr>
> <tr>	<td>1</td></tr>
> <tr>	<td>#DIV/0!</td></tr>
> <tr>	<td>#VALUE!</td>	<td>this is some text</td></tr>
> <tr>	<td>#DIV/0!</td></tr>
> <tr>	<td>0</td></tr>
> <tr>	<td>#NAME?</td></tr>
> <tr>	<td>#VALUE!</td></tr>
> <tr>	<td>#VALUE!</td></tr> {noformat}
>  
> My feeling is that we should definitely try to fix the booleans.  I'm not as sure about re-inserting the ERROR: .
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)