You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Panagiotis Garefalakis (Jira)" <ji...@apache.org> on 2021/05/11 15:19:00 UTC

[jira] [Updated] (HIVE-25103) Update row.serde excludes defaults

     [ https://issues.apache.org/jira/browse/HIVE-25103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Panagiotis Garefalakis updated HIVE-25103:
------------------------------------------
    Description: 
HIVE-16222 introduced row.serde.inputformat.excludes setting to disable row.serde for specific NON-Vectorized formats.
Since MapredParquetInputFormat is currently natively vectorized it should be removed from that list.

Even when hive.vectorized.use.vectorized.input.format is DISABLED
Vectorizer will not vectorize in row deserialize mode if the input format has is natively Vectorized so it is safe to remove.

Conf order to control vectorization:

1. hive.vectorized.use.vectorized.input.format
2. hive.vectorized.use.vector.serde.deserialize
3. hive.vectorized.use.row.serde.deserialize

  was:
HIVE-16222 introduced row.serde.inputformat.excludes setting to disable row.serde for specific NON-Vectorized formats.
Since MapredParquetInputFormat is currently natively vectorized it should be removed from that list.

Even when hive.vectorized.use.vectorized.input.format is DISABLED
Vectorizer will not vectorize in row deserialize mode if the input format has is natively Vectorized so it is safe to remove.

There is a precedence/order to control vectorization:

1. hive.vectorized.use.vectorized.input.format
2. hive.vectorized.use.vector.serde.deserialize
3. hive.vectorized.use.row.serde.deserialize


> Update row.serde excludes defaults
> ----------------------------------
>
>                 Key: HIVE-25103
>                 URL: https://issues.apache.org/jira/browse/HIVE-25103
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Panagiotis Garefalakis
>            Assignee: Panagiotis Garefalakis
>            Priority: Major
>
> HIVE-16222 introduced row.serde.inputformat.excludes setting to disable row.serde for specific NON-Vectorized formats.
> Since MapredParquetInputFormat is currently natively vectorized it should be removed from that list.
> Even when hive.vectorized.use.vectorized.input.format is DISABLED
> Vectorizer will not vectorize in row deserialize mode if the input format has is natively Vectorized so it is safe to remove.
> Conf order to control vectorization:
> 1. hive.vectorized.use.vectorized.input.format
> 2. hive.vectorized.use.vector.serde.deserialize
> 3. hive.vectorized.use.row.serde.deserialize



--
This message was sent by Atlassian Jira
(v8.3.4#803005)