You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "PJ Fanning (Jira)" <ji...@apache.org> on 2021/12/27 13:28:00 UTC

[jira] [Updated] (DRILL-8095) format-excel reader should ignore cell styles

     [ https://issues.apache.org/jira/browse/DRILL-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

PJ Fanning updated DRILL-8095:
------------------------------
    Description: 
I've recently added a feature to excel-streaming-reader (in v3.3.0) to optionally ignore cell style information. This is not enabled by default. It saves memory and processing time to ignore the cell styles.

The current Drill format-excel code does not use the cell styles.

At some point in the future, it may be worth having a Drill feature that allows it to infer the schema for the sheet based on the cell styles but until such a feature is added, the parsing the cell styles is a waste of compute resources.

If this sounds, useful, I can submit a PR.

  was:
I've recently added a feature to excel-streaming-reader (in v3.3.0) to optionally ignore cell style information. This is not enabled by default. It saves memory and processing time to ignore the cell styles.

The current Drill format-excel code does not use the cell styles.

At some point in the future, it may be worth having a Drill feature that allows it to infer the schema for the sheet based on the cell styles but until such a feature is added, the parsing the cell styles is a waste of compute resources.


> format-excel reader should ignore cell styles
> ---------------------------------------------
>
>                 Key: DRILL-8095
>                 URL: https://issues.apache.org/jira/browse/DRILL-8095
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Execution - Data Types
>            Reporter: PJ Fanning
>            Priority: Major
>
> I've recently added a feature to excel-streaming-reader (in v3.3.0) to optionally ignore cell style information. This is not enabled by default. It saves memory and processing time to ignore the cell styles.
> The current Drill format-excel code does not use the cell styles.
> At some point in the future, it may be worth having a Drill feature that allows it to infer the schema for the sheet based on the cell styles but until such a feature is added, the parsing the cell styles is a waste of compute resources.
> If this sounds, useful, I can submit a PR.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)