You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Beam JIRA Bot (Jira)" <ji...@apache.org> on 2020/09/13 17:08:03 UTC

[jira] [Commented] (BEAM-10159) Support Reading data from Databricks Delta

    [ https://issues.apache.org/jira/browse/BEAM-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17195083#comment-17195083 ] 

Beam JIRA Bot commented on BEAM-10159:
--------------------------------------

This issue was marked "stale-P2" and has not received a public comment in 14 days. It is now automatically moved to P3. If you are still affected by it, you can comment and move it back to P2.

> Support Reading data from Databricks Delta
> ------------------------------------------
>
>                 Key: BEAM-10159
>                 URL: https://issues.apache.org/jira/browse/BEAM-10159
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-ideas
>            Reporter: Ismaël Mejía
>            Priority: P3
>
> Databricks Delta is an open source storage layer on top of different filesystems. The current implementation of Delta is strongly coupled with Spark so we cannot rely on it because it would break Beam portability.
> However now there is an open specification for Delta's protocol.
> https://github.com/delta-io/delta/blob/master/PROTOCOL.md
> Another possible approach could be to investigate how if Beam could use a manifest based approach like Presto does:
> https://docs.databricks.com/delta/presto-integration.html



--
This message was sent by Atlassian Jira
(v8.3.4#803005)