You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/04 16:06:00 UTC

[jira] [Updated] (AVRO-3594) FsInput to use openFile() API for cloud storage read performance

     [ https://issues.apache.org/jira/browse/AVRO-3594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated AVRO-3594:
---------------------------------
    Labels: pull-request-available  (was: )

> FsInput to use openFile() API for cloud storage read performance
> ----------------------------------------------------------------
>
>                 Key: AVRO-3594
>                 URL: https://issues.apache.org/jira/browse/AVRO-3594
>             Project: Apache Avro
>          Issue Type: Improvement
>          Components: java
>    Affects Versions: 1.11.2
>            Reporter: Steve Loughran
>            Priority: Minor
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> avro can now use the FileSystem.openFile() API to open a file on a hadoop filesystem connector (HADOOP-15229).
> by setting the file length and fadvise policy through opt() calls, the clients can
> * skip a HEAD request when opening a file
> * optimise the ranges of GET request for sequential access, even in clusters where s3a has been configured to use random iO (which some hive clusters do)
> filesystems/releases which don't recognise the options added in HADOOP-16202 will ignore them; the api will fall back to classic open(path) API call if the connector doesn't have a custom implementation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)