You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Beam JIRA Bot (Jira)" <ji...@apache.org> on 2020/09/10 17:08:01 UTC
[jira] [Updated] (BEAM-10103) Add WasbIO / WasbFileSystem (Azure
Blob Storage)
[ https://issues.apache.org/jira/browse/BEAM-10103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Beam JIRA Bot updated BEAM-10103:
---------------------------------
Labels: stale-assigned (was: )
> Add WasbIO / WasbFileSystem (Azure Blob Storage)
> ------------------------------------------------
>
> Key: BEAM-10103
> URL: https://issues.apache.org/jira/browse/BEAM-10103
> Project: Beam
> Issue Type: Bug
> Components: io-ideas
> Reporter: Ashwin Ramaswami
> Assignee: Ashwin Ramaswami
> Priority: P3
> Labels: stale-assigned
>
> Azure Blob Storage can be accessed by using the wasb:// and wasbs:// protocols. This should be quite similar to the hdfs:// implementations already there.
>
> We should just be able to use it like this:
>
> {code:python}
> (
> p
> | ReadFromText("yourcontainer@youraccount.blob.core.windows.net/test/sample.txt")
> | WriteToText("output.txt", shard_name_template="", num_shards=0)
> )
> {code}
>
>
> See:
> [1] [https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage]
> [2] [https://hadoop.apache.org/docs/current/hadoop-azure/index.html]
> [3] [https://gerardnico.com/azure/wasb]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)