You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Julien Le Dem (JIRA)" <ji...@apache.org> on 2015/04/03 01:26:52 UTC

[jira] [Commented] (PARQUET-226) Introduce an interface for controlling the encoding per column

    [ https://issues.apache.org/jira/browse/PARQUET-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393718#comment-14393718 ] 

Julien Le Dem commented on PARQUET-226:
---------------------------------------

Enabling more precise configuration of encodings per column is a useful feature.
Could you describe how you want to control the encoding?
something like ```parquet.column.encoding.column_path=ENCODING``` ?

> Introduce an interface for controlling the encoding per column
> --------------------------------------------------------------
>
>                 Key: PARQUET-226
>                 URL: https://issues.apache.org/jira/browse/PARQUET-226
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>            Reporter: Boris Peltekov
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> There are use-cases in which it is advantageous to have control over the type of encoding used for a given column. To achieve this, the ParquetProperties class must be refactored to facilitate its subclassing. The constructors of ParquetWriter and (probably) InternalParquetRecordWriter have to be modified as well. All the internal parquet classes have to refrain from directly instantiating ParquetProperties and should accept the instance from outside. 
> Currently me and my team are using a good amount of reflection and unenforceable assumptions in order to gain control of this behavior. I'd like to implement the interface change. Are you interested in this feature?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)