You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/03/28 02:24:02 UTC

[jira] [Work logged] (BEAM-3437) Support schema in PCollections

     [ https://issues.apache.org/jira/browse/BEAM-3437?focusedWorklogId=85111&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-85111 ]

ASF GitHub Bot logged work on BEAM-3437:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 28/Mar/18 02:23
            Start Date: 28/Mar/18 02:23
    Worklog Time Spent: 10m 
      Work Description: reuvenlax opened a new pull request #4964: [BEAM-3437] Introduce Schema class, and use it in BeamSQL
URL: https://github.com/apache/beam/pull/4964
 
 
   We introduce the new Schema and Row classes. In this pull request, the classes are only used in BeamSQL. This replaces the previous BeamSQL RowType class which was a mapping from field name to coder. Nested arrays and rows are fully supported.
   
   Future PRs will add support for schemas on any PCollection.
   
   R: @akedin 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 85111)
            Time Spent: 10m
    Remaining Estimate: 0h

> Support schema in PCollections
> ------------------------------
>
>                 Key: BEAM-3437
>                 URL: https://issues.apache.org/jira/browse/BEAM-3437
>             Project: Beam
>          Issue Type: Wish
>          Components: beam-model
>            Reporter: Jean-Baptiste Onofré
>            Assignee: Jean-Baptiste Onofré
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> As discussed with some people in the team, it would be great to add schema support in {{PCollections}}. It will allow us:
> 1. To expect some data type in {{PTransforms}}
> 2. Improve some runners with additional features (I'm thinking about Spark runner with data frames for instance).
> A technical draft document has been created: 
> https://docs.google.com/document/d/1tnG2DPHZYbsomvihIpXruUmQ12pHGK0QIvXS1FOTgRc/edit?disco=AAAABhykQIs&ts=5a203b46&usp=comment_email_document
> I also started a PoC on a branch, I will update this Jira with a "discussion" PR.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)