You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Paul Rogers (JIRA)" <ji...@apache.org> on 2017/02/22 19:53:44 UTC

[jira] [Created] (DRILL-5291) Parquet Reader produces low density batches - variable width fields

Paul Rogers created DRILL-5291:
----------------------------------

             Summary: Parquet Reader produces low density batches - variable width fields
                 Key: DRILL-5291
                 URL: https://issues.apache.org/jira/browse/DRILL-5291
             Project: Apache Drill
          Issue Type: Bug
            Reporter: Paul Rogers
             Fix For: 1.8.0


See DRILL-5266 for background. That JIRA analyzed the problem with Parquet producing "low density" record batches. That JIRA focused on the issue with fixed-width fields: due to a bug, we overestimated the space taken.

Once that bug is fixed, Parquet continues to produce low density batches for variable-width fields. DRILL-5266 explains why.

This ticket covers the variable-width case so that we don't lose sight of it once the fixed-width case is fixed.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)