You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Mukund Thakur (Jira)" <ji...@apache.org> on 2022/08/10 22:44:00 UTC
[jira] [Created] (PARQUET-2171) Implement vectored IO in parquet file format
Mukund Thakur created PARQUET-2171:
--------------------------------------
Summary: Implement vectored IO in parquet file format
Key: PARQUET-2171
URL: https://issues.apache.org/jira/browse/PARQUET-2171
Project: Parquet
Issue Type: New Feature
Components: parquet-mr
Reporter: Mukund Thakur
We recently added a new feature called vectored IO in Hadoop for improving read performance for seek heavy readers. Spark Jobs and others which uses parquet will greatly benefit from this api. Details can be found hereĀ
[https://github.com/apache/hadoop/commit/e1842b2a749d79cbdc15c524515b9eda64c339d5]
https://issues.apache.org/jira/browse/HADOOP-18103
https://issues.apache.org/jira/browse/HADOOP-11867
--
This message was sent by Atlassian Jira
(v8.20.10#820010)