You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Neville Li (JIRA)" <ji...@apache.org> on 2016/06/23 17:05:16 UTC
[jira] [Updated] (BEAM-371) Backport HDFS IO enhancements from Scio
[ https://issues.apache.org/jira/browse/BEAM-371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Neville Li updated BEAM-371:
----------------------------
Description:
Right now there is a {{beam-sdks-java-io-hdfs}} module but only {{HDFSFileSource}} is implemented and there's a known issue with reading Avro files.
https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and simple authentication and would like to port it back to Beam.
https://github.com/apache/incubator-beam/pull/485
was:
Right now there is a {{beam-sdks-java-io-hdfs}} module but only {{HDFSFileSource}} is implemented and there's a known issue with reading Avro files.
https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and simple authentication and would like to port it back to Beam.
> Backport HDFS IO enhancements from Scio
> ---------------------------------------
>
> Key: BEAM-371
> URL: https://issues.apache.org/jira/browse/BEAM-371
> Project: Beam
> Issue Type: Improvement
> Components: sdk-java-extensions
> Affects Versions: 0.1.0-incubating
> Reporter: Neville Li
> Assignee: James Malone
> Priority: Minor
>
> Right now there is a {{beam-sdks-java-io-hdfs}} module but only {{HDFSFileSource}} is implemented and there's a known issue with reading Avro files.
> https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
> We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and simple authentication and would like to port it back to Beam.
> https://github.com/apache/incubator-beam/pull/485
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)