You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/08 17:45:09 UTC

[jira] Created: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser

Outstanding Improvements to Number/Date Formatting in ExcelParser
-----------------------------------------------------------------

                 Key: TIKA-360
                 URL: https://issues.apache.org/jira/browse/TIKA-360
             Project: Tika
          Issue Type: Improvement
          Components: parser
         Environment: All Operating Systems (Seen on my Ubuntu 9.10, Solaris 10 and Windows 7 instances)
            Reporter: Dave Meikle
            Assignee: Dave Meikle


As highlighted in TIKA-103, there are issues with Tikas parsing of Excel files due to the way Excel stores dates as numbers as well as the formatting applied by POI for these numbers. To provide base support for number/date formatting an initial patch was applied as part of TIKA-103 to apply POI's out-of-the-box formatting.

This issue is being raised to capture the progress of addressing outstanding formatting issues such as Fractions within the POI library.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser

Posted by "Dave Meikle (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12798064#action_12798064 ] 

Dave Meikle commented on TIKA-360:
----------------------------------

I am currently progressing Fractions support within POI under the bugzilla id 45678 [1].  Will look at other formatting issues once this is complete.

Cheers,
Dave

[1] https://issues.apache.org/bugzilla/show_bug.cgi?id=45678

> Outstanding Improvements to Number/Date Formatting in ExcelParser
> -----------------------------------------------------------------
>
>                 Key: TIKA-360
>                 URL: https://issues.apache.org/jira/browse/TIKA-360
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>         Environment: All Operating Systems (Seen on my Ubuntu 9.10, Solaris 10 and Windows 7 instances)
>            Reporter: Dave Meikle
>            Assignee: Dave Meikle
>
> As highlighted in TIKA-103, there are issues with Tikas parsing of Excel files due to the way Excel stores dates as numbers as well as the formatting applied by POI for these numbers. To provide base support for number/date formatting an initial patch was applied as part of TIKA-103 to apply POI's out-of-the-box formatting.
> This issue is being raised to capture the progress of addressing outstanding formatting issues such as Fractions within the POI library.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.